hep-ph/0102234 CERN-TH/2001-46 IFUP-TH/2001-6 SNS-PH/01-3 



Frequentist analyses of solar neutrino data 

(updated including the first CC data from SNO^) 
(updated including the NC and day/night data from SNO^) 

(updated including the first data from KamLAND^) 
(updated including SNO data with enhanced NC sensitivity'') 
(updated including final SNO 'salt' data and 2004 KamLAND data^) 



Paolo Creminelli, Giovanni Signorelli 

Scuola Normale Superiore and INFN, Sezione di Pisa, Italy 
Alessandro Strumia 

CERN, Geneva, Switzerland and Dipartimento di Fisica dell'Universitd di Pisa and INFN 

Abstract 

The solar neutrino data are analysed in a frequentist framework, using the Crow- 
Gardner and Feldman-Cousins prescriptions for the construction of confidence re- 
gions. Including in the fit only the total rates measured by the various experiments, 
both methods give results similar to the commonly used Ax^-cut approximation. 
When fitting the full data set, the Ax^-cut still gives a good approximation of the 
Feldman-Cousins regions. However, a careful statistical analysis significantly reduces 
the goodness-of-fit of the SMA and LOW solutions. 

In the addenda we discuss the implications of the latest KamLAND, SNO and SK 
data. 

1 Introduction 

The solar neutrino anomaly is an old but still controversial problem, in which many experimental data ^EIEIE] 
and theoretical ingredients El El El El have to be merged to give predictions for the oscillation parameters and 
to rule out other non-standard explanation of the anomaly. A correct statistical treatment is a necessary step 
of the analysis. The starting point for interpreting the results of an experiment is the fact that one knows the 
probability distribution p(data|theory) for obtaining a set of data under the assumption that a given theory is 
true. In the case of the solar neutrino anomaly (at least in its simplest version), we know p{Ri\Am? , 9), where 
Ri are the three neutrino rates measured in Chlorine, Gallium and SK experiments, which should be used to 
infer the values of the theoretical parameters Am^ and 6. 

This can be done according to two conceptually very different approaches , each one with unsatisfactory 
aspects. 

^The addendum at pages [1111131 f section 6) is not present in the published version of this paper. 
■^The addendum at pages [T!^17l (section 7) is not present in the published version of this paper. 
^The addendum at pages IT9ll21l ^section 8) is not present in the published version of this paper. 
*The addendum at pages 1^^231 (section 9) is not present in the published version of this paper. 
^The addendum at page I24l (section 10) is not present in the published version of this paper. 
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• The Bayesian approach employs a probability distribution p{Am'^,9) to summarize our knowledge 
of the parameters of the theory. According to elementary properties of probability, this probability gets 
updated by the inclusion of the results of a new experiment as p(Am^, 6\R) oc Am^, 6)p{Am'^ , 6) . The 
drawback is that one needs to choose some 'prior' p{Am^,9) to start with, and the final result depends 
on this choice until experiments are sufficiently precise. At the moment, solar neutrino fits give multiple 
distinct solutions so that p{Am^ , 6\R) still contains arbitrary order 1 factors. The advantage is its extreme 
simplicity: the laws of probability dictate what to do in any situation. 

• The frequentist approach refuses the concept of probability of theoretical parameters. The Neyman 
construction allows us to build range of parameters for any possible outcome of an experiment with 
the property that 90% (or whatever) of such ranges contain the true value. However this procedure 
is not univocal and the resulting regions can be quite different. For example the Crow-Gardner |12| 
procedure gives smaller regions in presence of unlikely statistical fluctuations in the measured outcome of 
the experiment, while the Feldman-Cousins procedure gives ranges of roughly the same size for all 
possible outcomes. 

In simple cases when p(data|theory) is a Gaussian function of all its arguments (data and parameters, with no 
physical constraints on them), the Bayesian approach (using a flat prior p) and the frequentist approach (using 
the Feldman-Cousins method) are numerically equivalent to the commonly employed Ax^-cut approximation. 

When fitting solar neutrino data one has to be careful because: 

(1) p{Ri\Am^,6) is a highly non-Gaussian function of Arn^, 9: in fact one finds a few separate best-fit solutions 
(usually named 'LMA', 'SMA', 'LOW, 'VO') while a Gaussian would have only one peak. This is the 
problem that we will address in this paper. 

(2) p{Ri\A'm^ ,9) is not perfectly Gaussian as a function of Ri. Assuming a Gaussian uncertainty on the 
detection cross sections cr and on the solar fluxes $, one does not obtain a Gaussian uncertainty on the 
rates i? ~ cr • $. In principle this is true; in practice the errors on a and $ are sufficiently small that their 
product is also almost Gaussian, up to very good accuracy. 

Such issues have been studied in finding that (1) apparently has a dramatic effect (see fig. 3 of ^J: LMA 
and LOW merge in a single region), while (2) has a negligible effect (see table II of J^). We will ignore (2) 
and we therefore write the probability density function (pdf) for all the n solar neutrino data Xi as 



The predicted values x^^ and the covariance matrix depend on Am^ and 9. The covariance matrix con- 
tains both theoretical and experimental errors, statistical and systematic, added in quadrature. This is the 
standard procedure, which can be justified by applying the Neyman construction in a Bayesian framework (i.e. 
by describing theoretical and systematic uncertainties using a probability distribution). A strict frequentist 
framework employs a definition of probability that makes it unclear how to deal with systematic and theoretical 
uncertainties. 

Using the analytical properties of Gaussians enormously simplifies the computation: we will not need lengthy 
and obscure computer calculations. The probability p is computed as described in the appendix. We will study 
oscillations among the three active neutrinos in a two fiavour setup. We could study much more general cases, 
but experiments indicate that this seems to be the relevant case^. Am^ = — > is the squared mass 
difference relevant to solar neutrinos, and < 9 < 7r/2 is the corresponding mixing angle. 

®In a 3 f framework, the Ue can also oscillate at the atmospheric Am^. The CHOOZ bound 1151 implies that the relative mixing 
angle is so small that it can only have a minor effect on solar neutrinos. The LSND anomaly 1161 motivates models with a fourth 
sterile neutrino. However LSND is significantly constrained directly by Bugcy 1171 and Karmen 1181 . and indirectly by SK 1^1191 
that disfavours a significant sterile contribution to both the atmospheric and solar anomalies. These indirect bounds can be evaded 
in models with many sterile neutrinos. 
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In section 12 we fit tlie data about the total rates using the Crow-Gardner and Feldman-Cousins construc- 
tions, which are compared with the commonly used A^^-cut approximation. We do not find dramatic differences 
(see fig. A fit based on the Ax^ approximation does not miss any relevant physical issue. In sectionOwe 
include in the fit the SK spectral and day/night data. We now find more marked differences between the var- 
ious methods for building Neyman's confidence regions (sec fig. EJ. In section 0] we show that the well-known 
statement that LMA, LOW and SMA presently give a good fit is based on an inappropriate statistical test, and 
we recompute the goodness-of-fit of the various solutions (see table 1). Our conclusions are drawn in sectional 



2 Different frequentist analyses: rates only 

We want to compare exact and approximate methods to compute confidence regions. To begin with, we consider 
only the total rates measured at Homestake, SuperKamiokande and the weighted sum of the two Gallium 
experiments: GALLEX-GNO and SAGE. All fits done so far (except use the approximated method based 
on the Ax^-cut; this approximation will be compared with two frequentist constructions: the Crow-Gardner |12| 
and Feldman-Cousins ^3 methods. 

The use of the A^^-cut is based on the well-known likelihood ratio theorem |2()j . which states: given a 
conditional pdf p{x\m) (x is the data vector and m are the parameters we want to estimate) with a range for 
X independent from the value of m, the quantity 

X{x; m = 2 log — (2) 

V p{x\m) J 

is distributed as a with dim(m) degrees of freedom (dof), independently from the value of m, in the limit 
dim(a;) oo. With the pdf of eq. this leads to 

X{x;Am^,9) = - xLt + log det cr^ _ log det cr^,,t , (3) 

where x^ is the usual sum in the exponent of eq. ct^ is the covariance matrix and the subscript 'best' indicates 
that the corresponding quantity must be evaluated at the value of Am^ , 9 that maximizes the probability for 
the given measured x. In the limit of infinite data, A is distributed as a x^ with two degrees of freedom (the two 
parameters we are studying: Am^ and 6. If lO^'^eV^ <Am? < 10~''eV^ one can obtain poor fits with energy 
independent survival probability. In this case the experimental results only depend on the single parameter 9). 

The simplest way to construct confidence regions using this asymptotic property is to include all values 
of (Am^,6') for which A is less than a critical value, which can be obtained from the x^ distribution tables. 
Neglecting the In det ct^ term — which is not a constant — gives the well-known approximate rule 

Ax' < /?, (4) 

where (3 depends on the confidence level (CL) wc want to quote. This is the method most analyses use to obtain 
the confidence regions. It is twofold approximate: it neglects the log det ct^ dependence and, since the number 
of solar data is finite, does not ensure the correct CL. 

The correct frequentist construction of the confidence regions is a well-known procedure: for any point m in 
the parameter space (Ato^ and 9, in our case) one has to arbitrarily choose a region A{m) in the space {a;} of 
the data (the three rates, in our case) which contains the CL% of the probability. The knowledge of the pdf 
allows to do this. The confidence region B{x) is given by all the points in the parameter space that contains 
the measured value of the experimental data in their acceptance region: 

Bix) {m\x e Aim)}. (5) 

It is easy to realize that, taken any true value for the parameters, the quoted region contains this true value in 
the CL% of the cases. 
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Figure 1: Confidence regions at 90% (left) and 99% (right) CL obtained from the measured solar rates using 
three different methods. The smallest regions (continuous line) are obtained with the Ax^ approximation; they 
are surrounded by the Feldman-Cousins regions (dashed line). The largest regions (dotted line) are obtained 
with the Crow-Gardner procedure. 



The arbitrariness in the choice of the acceptance region A{m) can be fixed by choosing a particular ordering 
in the data space: the construction of A{m) is made adding a;-cells in that order until the requested probability 
(coverage) is reached. Crow and Gardner (CG) proposed |^ an ordering based on p{x\m), while Fcldman and 
Cousins (FC) proposed ^31^ an ordering based on the likelihood ratio 

p{x\m)/p{x\mbcst{x)). (6) 

This means that the 'priority' of a point x for fixed m is given by its probability relative to the probability 
obtained with the parameters set at the best-fit value mbest{x) corresponding to x. 



2.1 Feldman— Cousins fit 

The FC ordering requires cumbersome numerical computations, but guarantees that the FC acceptance regions 
share the nice properties of the approximate Ax^-cut method. The FC ordering disregards the statistical 
fluctuations with no information on the parameters. If the measured rates arc unlikely for any value of the 
parameters, the FC procedure "renormalizes" the probability when determining the confidence regions. It is 
easy to see that the FC acceptance regions are never empty for any choice of the confidence level: every point 
in the data space belongs at least to the FC acceptance region of the parameter mbost that maximizes its 
probability^. 

''The Feldman-Cousins method supplements Neyman's prescription in a way that solves some of the problems that characterize 
other methods. Some unwanted and strange properties are however still present, and the proposed way out 1211 goes beyond 
Neyman's construction, loading, always in a completely frequontist approach, to the definition of a now quantity called Strong 
Confidence Level. 

*The Feldman-Cousins procedure gives no empty confidence regions if all the points with the same likelihood ratio are included 
in the acceptance region for given parameters, even after the given probability is reached: this can give a certain overcovering, but 
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Figure 2: Approximate shapes of the acceptance regions for two near parameter points in the FC case (left) 
and in the CG case (right). In the first case the regions are stretched perpendicularly to the theoretical surface, 
while in the second they are ellipsoidal. A parameter point is accepted if the measured experimental point (black 
dot in the figures) lies inside its acceptance region. 



The FC procedure has many pomts in common with the approximate Ax^-cut method: looking at eq.s 
Q at fixed m we sec that the inequahty \ < (3 chooses A{m) with the same ordering as the FC method. The 
only difference is that the limit [3 is chosen, using the asymptotic distribution of A, independent of m, while 
the exact method gives a limit P{m) that depends on the oscillation parameters m. 

It is easy to check that the A^^-cut is exactly equivalent to the FC construction if the pdf is Gaussian with 
constant covariance matrix and with theoretical rates that depend linearly on the parameters (by 'theoretical 
rates' we indicate the most likely value of the rates, for given values of Ato^ and 9). In the linear approximation 
the theoretical rates, obtained varying the two parameters Am^ and 9, form a plane in the three dimensional 
space of the rates^. The comparison with this linear approximation helps us to understand whether the A^^-cut 
is a good approximation. Two different behaviors are possible for a given m: 

• The value of P{m) given by the FC procedure is smaller than the approximated one derived by the A^^- 
cut. This happens, for example, if we measure values of the parameters near the edges of the parameter 
space. The Ax^ approximation assumes an infinite hyperplane of theoretical rates: the points of the data 
space that have maximal probability for 'non-physical' values of the parameters will be included in the 
acceptance regions of the points near the edge, reducing their limit /3 to have a correct coverage. 

• The value of f3(rn) is larger than the approximated one. This happens when different regions of the 
parameter space give similar predictions for the data. A data point that is included in the acceptance 
region of a given m in the linear approximation may have a bigger p(a;|mbost) because of the folding of 
the hypersurface, which lowers its likelihood ratio. To reach the requested probability a bigger value for 
(3 is needed. 

Within the LMA, LOW and SMA regions the linear approximation is pretty good, as the curvature of the 
'theoretical surface' is small with respect to the typical errors on the rates. The effects due to the edges of the 

is essential to get this good property. Some pathological situations are still possible if the best-fit point does not exist 1211 . 

^It is interesting to note that even if a fundamental property of frequentist inference is its independence from the metric and 
topology of the parameter space, the validity of the Ax'^-cut approximation strongly depends on them. 
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Figure 3: Comparison between 
the CG acceptance region (black) 
and the FC one (red) at 90% 
CL for a parameter point in the 
SMA region. Chlorine and Gal- 
lium rates are in SNU, while 
the SK rate is in 10^ cm~^s~"'^. 



The presence of LMA points with 
comparable predictions makes the 
FC region asymmetric and dis- 
connected. 



theoretical surface can also be neglected. The main deviation from the Ax^ approximation is due to the fact 
that SMA, LMA, LOW points give similar predictions: the surface of theoretical rates is folded. Constructing 
the acceptance region for an oscillation parameter m in the SMA region, we find points with best-fit parameters 
in the LMA region, so that the Ax^ approximation is expected to give some undercoverage. The FC acceptance 
regions at 90% and 99% CL are plotted in figure Q we see that the regions obtained from the Ax^-cut are 
smaller than the exact FC regions. For example, the approximated Ax^ cut at 90% CL is Ax^ < 4.6. The 
value of (3{m) at 90% CL obtained with the FC construction is w (4.6 ~ 5.5) for m in the SMA region and 
~ (4.8 5.5) for m in the LMA region. Furthermore, owing to the variation of logdetcr^ (neglected by the 
Ax^ approximation), in the SMA and LOW regions the FC boundary intersects the Ax^ boundary, instead 
of surrounding it. The difference between the approximate and rigorous methods is however small enough to 
justify the Ax^ approximation. 



A second way to construct confidence regions is based on the CG ordering: the acceptance regions are built 
beginning from the points of highest probability. Such ordering is not invariant under a reparametrization of the 
experimental data (e.g. a CG fit of the rates is different from a CG fit of the squared rates). Such acceptance 
regions are the smallest with the given coverage. The difference between the FC and the CG procedures is 
essential for those points that are unlikely for any value of the parameters, i.e. all the points far from the surface 
of theoretical rates. We have seen that, with the FC ordering, every data point is included in the acceptance 
region of at least one point in the parameter space, but this is obviously not true for the CG method. Consider 
for example the linear approximation in which the theoretical rates describe a plane in the rate space. For a 
given m, in view of (Q, the CG acceptance region will be an ellipsoid centered in the most likely value for the 
rates, which becomes larger and larger with growing CL. The FC acceptance regions will be very different and 
stretched to infinity in a cylindrical shape perpendicularly to the plane (see figureEJ. This is clear if we consider 
that in this approximation the maximum likelihood point mbost is obtained by projecting a data point on the 
'theoretical plane' (in the base where the covariance matrix is proportional to the identity): all the points lying 
on a line perpendicular to this plane and intersecting it in the point described by m have likelihood ratio equal 
to 1 and are included in the acceptance region. 

In figure 13 we compare the FC and CG acceptance regions for one given SMA oscillation. We see that the 
CG region is ellipsoidal as expected, while the FC one is stretched, but in one direction only. This is due to 



2.2 Crow— Gardner fit 
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the strong non-linearity of the 'theoretical surface': LMA and SMA have similar rate predictions even if they 
have very different parameters. The surface is folded and this causes the asymmetry in the FC region: the 
acceptance region is deformed and disconnected to get far from the LMA predictions. 

The CG acceptance regions at 90% and 99% CL are plotted in figure The differences between the FC 
and CG confidence regions are readily understood. If we fix the experimental data and begin from a very low 
CL, we expect an empty CG region (there is no ellipsoid that contains the data) while the FC region is small 
but non empty. As the CL increases, a CG region appears (roughly at 4% CL in our case) and all the regions 
grow. With a large CL we expect the CG regions to be larger than the FC ones as the ellipsoids have a larger 
projection on the 'theoretical surface' than the stretched FC acceptance regions (see fig. EJ- AH these features 
can be checked explicitly in figure 

In conclusion two points must be stressed: first of all we have checked that a correct frequcntist approach 
gives results only slightly different from the naive analysis based on the Ax^-cut. This is apparently in contrast 
to what is obtained in |14j . The main difference between that analysis and ours is that we use all three rates to 
construct confidence regions, while J3] finds, with a Monte Carlo simulation, the distribution of the maximum 
likelihood estimators Arh^ and 9 and construct from this the confidence regions, using the Crow-Gardner 
ordering. Since the two Am^, are not a sufficient statistics for the three rates, this procedure implies a certain 
loss of information, which leads to larger confidence regions. 

A second point is the comparison between the two methods, CG and FC: the results are pretty similar. 
As we will see in the next section, this is no longer the case when the SK data on the angular and energy 
distribution are included in the fit. Wc have not shown 'vacuum oscillation' fits of the solar rates because they 
are strongly disfavoured by the SK data. 

3 The inclusion of the whole data set 

The SuperKamiokande collaboration has also measured the energy spectrum of the recoil electrons as a function 
of the zenith-angle position of the sun. The full data set usually employed in solar neutrino fits contains 38 
independent dof (see the appendix). With such a number of data it is practically impossible to perform a 
complete numerical construction of the acceptance regions, without any approximation. For example, even if 
we divide every dimension of the data space in only 20 cells, we arrive to a 38-dimensional space divided into 
^ 10^° cells. For this reason we cannot construct the FC confidence regions with all the data set. However, 
for the same reasons described in the previous section, the approximated A^^-cut method is expected to be a 
reasonable approximation of the FC construction and to give confidence regions slightly smaller than the FC 
ones. 

For the CG ordering the situation is better. Since we have approximated the pdf as a Gaussian function 
of the data, the CG construction is equivalent to a cut on the with 38 dof (rather than on the Ax^). Note 
that in this case the procedure is exact even if logdettr^ is not constant. For any m the x^"Cut defines an 
ellipsoidal acceptance region A{m), and the confidence region is given by the set of parameters with smaller 
than a given value. The comparison between the CG ordering and the approximated Ax^-cut can be done 
analytically: for any given value of CL 



The comparison between the CG method and the Ax^-cut, which we can consider an approximation of the FC 
ordering, is shown in figure^land presents all the features described in the previous section. The differences are 
now rather evident. The CG regions arc empty until the w 40% CL, while the Ax^ regions arc never empty (as 
the FC ones). The two methods give equal regions for ~ 45% CL. With a larger CL the CG regions are bigger 
than the Ax^ regions. Figure 01 shows that the arbitrariness in constructing frequcntist acceptance regions 
can be quite significant. For example a CG fit accepts a large Am? « 10~'^eV^ at « 75% CL, while the Ax^ 



FC w Ax^-cut : 
CG = x^-cut : 



X 



2(Am2; 0) - xLst < Quantilc(x22 ^of , CL) 
x2(Am2; 0) < Quantile(x238dof. CL). 



(7) 
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Figure 4: Comparison between the Ax^-cut (left), the CG method (middle) and a Bayesian fit done assuming 
a flat prior in the plotted variables (right). We show the 35% (dotted lines), 68% (dashed), 90% (continuous) 
and 99% CL (dot-dashed) regions. 



approximation to the FC fit rules it out at « 98% CL. One should keep in mind the statistical assumptions 
behind this limit, when using it to demonstrate the necessity of a hierarchy between the squared-mass differences 
characteristic of the solar and atmospheric anomalies. We now explain the reason of this difference and argue 
that the FC fit is the relevant one. 

It could seem strange that, after including all data, the CG confidence regions with high CL are much larger 
than when fitting only the rates. This happens because the CG method does not use the information on the 
parameters contained in the data in the most efficient way. Statistical fluctuation leads to experimental results 
that do not lie on the theoretical surface. The CG ordering, treating in the same way the directions in the 
data space 'perpendicular" and 'parallel' to the surface, does not use the information contained in the 'distance' 
from the theoretical surface to obtain further information on the parameters. This is why the CG ordering 
leads to strange results, though correct from the coverage point of view. The FC method uses the 'distance' 
from the theoretical surface to recognize and eliminate the statistical fluctuations that have nothing to do with 
the determination of the parameters. This difference between the two procedures is much more significant 
when fitting the full data set than when fitting the rates only. There are now many more data than unknown 
parameters: the theoretical surface is two-dimensional in a 38-dimensional data space. 

In figure 01 we also show a Bayesian fit, done assuming a flat prior dp{/S.m? , 9) = din Am^ dXnianO. Unlike 
the CG fit. this Bayesian fit should coincide with the Ax^ approximation to the FC fit if the pdf were a Gaussian 
function of In Ato^ and InianO. By comparing the two fits we can again sec visible but not crucial corrections 
due to the non-Gaussianity in the theoretical parameters Am, 9. The arbitrariness in the prior distribution 
function p{/S.m? , 9) gives an uncertainty comparable to the effect due to the non-Gaussianity. 

4 The goodness of fit 

A goodness-of-fit (GOF) test studies the probability of the experimental data p(data|theory) under given theo- 
retical hypotheses (a model of the sun, the assumption that neutrinos are oscillating rather than decaying. . . ) 
leading to statements of the form: if all the hypotheses are true, the probability that the discrepancy between 
predictions and data is due to statistical fluctuations is less than a certain amount. The purpose is to understand 
if the theoretical hypotheses used to explain the data are plausible or not. 

When analyzing the most recent data, one encounters the following paradoxical situation: the LOW solution 
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(a) Rates only 



(6) Rates and spectra: 
naive result 



(c) Rates and spectra: 
refined result 
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Table 1: GOF of the SMA, LMA, LOW and energy-independent solutions obtained by fitting (a) only the 
rates, (b) including all the data, (c) including only the 'most significant' data. The energy-independent solution 
contains only one free parameter, while SMA, LMA, LOW have two free parameters. Our results are compared 
with the corresponding ones in ]2'A \2!A \14i - The symbol < recalls that the GOF values could be slightly lower. 

gives a poor fit of the solar rates only (e.g. [23| finds a GOF of 0.5%). After including the full data set, LOW 
gives a good fit (e.g. finds a GOF of 40%). The paradox is that we have added 35 zenith and energy bins, 
in which there is no signal for neutrino oscillations. It is clearly necessary to understand better the meaning of 
the GOF test before we can decide if LOW gives a decent fit or not. This important question also applies to 
the other alternative solutions. Such tests are based on Pearson's x^' the quantity [20! 

Y.{xr^£f)^^{:^''~xf) (8) 

is asymptotically distributed as a with A'data ~ A^param dof^". The hats indicate that the corresponding 
quantity must be evaluated at the maximum likelihood parameter point. The CG ordering is deeply linked to 
this GOF test: the absence of a confidence region until a given CL value is correlated to the goodness of the fit. 

The paradoxical increase of the GOF of LOW is clearly due to the fact that the Pearson test does not 
recognize that there is a problem concentrated in the three solar rates that contain all the evidence for neutrino 
oscillations. It only sees that the total is roughly equal to the large total number of dof (38), so that the fit 
seems good. According to this procedure, even a global three neutrino fit of solar, atmospheric and LSND data 
would seem good. It is well known that the LSND anomaly requires a fourth neutrino, or something else. 

We now explain why the Pearson's test is not adequate for such a situation. Pearson's test does a precise 
thing: it tests the validity of a certain solution with respect to a generic alternative hypothesis, which has a 
sufficient number of parameters to fit all the data with infinite precision. Therefore the inclusion of more data 
changes also the set of alternative hypotheses which we compare with. Describing the recoil electron spectrum 
in terms of 18 energy bins implies that we admit alternative theories with fuzzy energy spectra. No physical 
mechanism could generate an irregular spectrum, so that we do not want to test this aspect. The measured 
spectrum is of course regular, and Pearson test rewards the LOW solution for this reason. To better understand 
this point, suppose that we add as new data the direction of arrival of the interacting neutrinos. All solutions 
(including the no-oscillation hypothesis) would have a higher GOF. It is obvious that these solutions are much 
better than a generic one, because they at least 'know' where the Sun is. A meaningful test should include 
only those data that really test the hypothesis under consideration. On the contrary, the inclusion of irrelevant 
data does not affect the confidence regions built with the FC ordering, so that a naive application of the Ax^-cut 
correctly approximates the best-fit regions. 

We therefore conclude that testing the goodness of the fit using a lot of energy bins gives a formally correct 
answer to an irrelevant question. If a smaller set of data were used to describe the spectral and angular 
information, the set of alternative hypotheses would be more reasonable and one would conclude that there is a 
goodness-of-fit problem. Most of the information on the energy and zenith-angle spectra can be condensed into 

^"Thc result is exact if the variation of logdctcr^ can be neglected. Furthermore, with a finite number of data, this test is 
exact only with theoretical rates depending linearly on the parameters. The deviation from this approximation leads to a small 
overestimate of the GOF: for example, fitting only the rates, the GOF of SMA gets corrected from 51.8% to 48.4% I14| . 
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observables such as the mean recoil electron energy and the day/night asymmetry, as shown in fits presented 
by the SK collaboration 0j. 

Within our assumption of 2-neutrino oscillations, the main new information encoded in SK spectral and 
day/night data is that the survival probability Pee{E^) can only have a mild energy dependence around E^, ~ 
lOMeV. This can be seen in a simple way by parameterizing Pee{Ei,) as 

The SK spectral data measure Pg = —0.05 ± 0.1 and disfavour the SMA solution because it prefers a larger 
slope Pg. Significant non-linearities in Pee{E^) are not predicted in SMA, LMA, LOW oscillations, nor could 
be recognized by SK (the present error on Pq' is ^ 1). 

In conclusion most of the information contained in the SK spectral data can be conveniently condensed into 
a single observable /, that measures the slope of Pee{E^). One possible choice is the ratio between the rate of 
'low energy' (i.e. Te < 9 MeV) and 'high energy' (i.e. 9 MeV < Tg < 13 MeV) recoil electrons, as measured by 
SK. The upper bound on the recoil electron energy Tg has been chosen in order to avoid potential problems due 
to an enhanced flux of hep neutrinos. The measured value and the uncertainty on f{pi) can be easily deduced 
from the SK data on the full energy spectrum: CTj = fioljfj (in Gaussian approximation), where fi = df /dpi, 

{pi} is the full set of SK bins and ct|j is the full error matrix. 

By supplementing the fit of the total neutrino rates with a single observable / wc find the GOF values 
shown in table 1. The symbol < recalls that, especially in the SMA case, it could be possible to obtain slightly 
lower GOF values by identifying another observable more sensitive to the energy dependence of the neutrino 
survival probability. However, a variation of the GOF between, say, 1% and 4%, is within the uncertainty 
due to arbitrariness inherent in any statistical analysis. The important point is that the GOF values are 
significantly lower than the values based on a naive test, and motivate a non-standard analysis of the solar 
neutrino anomaly [SJ. The SK collaboration [2] finds that SMA now gives a poor fit using another reasonable 
procedure: at 97% CL, the region favoured by total rates falls inside the region disfavoured by spectral and 
day/night data. 

5 Conclusions 

Fits of solar neutrino data are usually done using the A^^-cut valid in the Gaussian approximation and find few 
distinct best-fit solutions (LMA, SMA, LOW, . . . ). Since a Gaussian would have only one peak, it is useful to 
check the validity of the Ax^-cut approximation by comparing its results with the exact confidence regions built 
using the Neyman construction. This has been done using two different ordering prescriptions, proposed by 
Crow-Gardner (CG) and by Feldman-Cousins (FC). We find that the Ax^ cut provides a good approximation 
to the Feldman-Cousins confidence regions. 

When the full data set is used, there is some significant difference between the CG and FC regions. Even if 
the two methods give regions with the same statistical meaning, their conceptual significance is different. The 
FC regions are not influenced by statistical fluctuations with no information on the oscillation parameters, while 
the CG regions are composed by all the oscillation parameters that provide an acceptable fit of the data. The 
meaning of the results is deeply influenced by the assumptions involved in the statistical analysis. We finally 
show that a correct understanding of the meaning of Pearson 'goodness-of-fit' test invalidates the statement 
that all solutions (LMA, LOW and SMA) presently give a good fit. 

We hope that our refined statistical analysis has been useful for clarifying some aspects of solar neutrino 
fits. Only refined experimental data will allow to identify the solution of the solar neutrino problem. We plan 
to update the electronic version of this paper, if new significant experimental data will soon be presented. 
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Figure 5: The 'rates only' fit of fig. ^ updated including the SNO results. Confidence regions at 90% (left) 
and 99% (right) CL obtained from the four solar rates using three different methods: the Ax^ approximation 
(continuous line), the Feldman- Cousins procedure (dashed line) and the Crow- Gardner procedure (dotted line). 



6 Addendum: CC SNO results 

In this addendum, we update our results by adding: 

• the Ved — > ppe (CC) rate measured at SNO [221 above the energy threshold Tes > 6.75 MeV: R^^q ~ 
(1.75 ± 0.13)10*^ cm-2s~i. 

• the latest SK results [201 (1258 days). The total flux now is Rsk ^ (2.32 ± 0.08) • lO^cm-^s"!. The 
day/night energy spectrum contains a new bin with energy 5 MeV < < 5.5 MeV. 

• Furthermore, we explicitly include the CHOOZ data ^Hl in the global fit. CHOOZ data just disfavour 
values of Am^ above 0.7 10~'^eV^ for large mixing 6 ~ 7r/4 and are irrelevant at smaller Am^, 9. 

The updated fit of the four Chlorine, Gallium, SK and SNO rates (without including spectral data) is shown in 
fig- El The Feldman-Cousins SMA region is somewhat smaller than what obtained from the Ax^ approximation: 
this is mainly due to the variation of the determinant of the covariance matrix in the SMA region (see section 2). 
Anyway there is no significant discrepancy between the frequentist fits and the A^^ Gaussian approximation. 

Fig.Elshows the results for the global fit, including all the rates and the day/night spectrum measured by SK. 
The Crow-Gardner (CG) confidence regions are composed by all oscillation parameters with a high goodness- 
of-fit, evaluated in a naive way. As explained in section 3 the CG procedure gives very large confidence regions 
because it is an inefRcient (although statistically correct) procedure. The Feldman-Cousins (FC) procedure 
gives smaller regions because it is more efficient: unlike the CG procedure it recognizes and eliminates the 
statistical fluctuations with no information on the oscillation parameters. In the Gaussian approximation, it 
should agree with the Bayesian fit with fiat prior distribution: the differences between the A^^-approximation 
and the Bayesian are visible in fig.|Slbut not significant. 
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Figure 6: The 'global' fit of fig. ^ updated including the SNO results. Comparison between the the CG method 
(left), the l^x^-cut (middle), and a Bayesian fit done assuming a flat prior in the plotted variables (right). We 
show the 35% (dotted lines), 68% (dashed), 90% (continuous) and 99% CL (dot-dashed) regions. 



How strongly is now the SMA solution disfavoured? Remembering that the difference in the best-fit 
powerful statistical indicator, the main results can be summarized as 




x'(SMA) = x'(LMA)+ <^ THi ^ ' X^LOW) = xlLMA) 
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our fit 


4.5 


ED 


3.1 
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m 



(10) 



where, for comparison, we also report the results of three other post-SNO analyses. Like in |32| and unlike 
in 1^03 we improved the definition of the by using as data the SK flux in the 18 + 18 energy day/night 
bins, rather than separating the SK data into total flux and un-normalized spectrum. This improved takes 
into account statistical correlations between the total SNO and SK rates and the SK spectral data. Due to 
the different energy thresholds in SK and SNO such kind of correlations arc now non negligible^-'^: using the 
improved the difference Ax^ = x^(SMA) — x^(LMA) is reduced^^ by 3. A MonteCarlo simulation would 
be necessary to determine precisely how Ax^ is distributed: presumably (|10|l implies that the evidence for LMA 
against SMA is now somewhere between "3cr" and "4cr" . 

As discussed in section 4, the claims in |32IS31 that the SMA solution (and few other disfavoured solutions) 
still give a 'good fit' is based on an inefficient goodness-of-fit test on the value of some global . In the case 
of parameter determination, the Crow-Gardner procedure is statistically correct (like the global x^ test), but 
no one employs it just because it is inefficient (as clear in fig. EJ. Using the more efficient goodness-of-fit test 
discussed in section 4, we get the updated post-SNO results 

GOF(LMA) - 25%, GOF(LOW) - 4%, GOF(EI) - 1%, GOF(SMA) - 0.4%, 

where EI denotes the 'energy independent' solution obtained for Am^ > 2 • 10~'*cV^. Unlike few years ago LMA, 
LOW or EI oscillations are now better than SMA not because new data gave new significant positive indications 

^'^A different effect that shifts the SMA region by a comparable amount is the variation of the detc^ term in eq.s 0, isl . 

^•^The SMA solution is still present at 99% CL only in the plots in 1321 due to a simpler reason: they used the Ax^ corresponding 
to 3 degrees of freedom (rather than to 2 degrees of freedom as here and in 1311 1331 ) because their analysis was made in a model 
with a fourth sterile neutrino and '2 + 2' mass spectrum, where solar oscillations depend on one more parameter. 
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for them, but because new data (compatible with no spectral distortion) gave a strong evidence against SMA. 
The 'worse' solutions of few years ago are now the 'better' ones, but cannot give very good fits. Of course all 
results in this paper are based on the (sometimes questionable) assumption that all systematic and theoretical 
uncertainties involved in the game have been correctly estimated. The final results do not significantly change 
if the (not yet calibrated) Chlorine result is not included in the fit. 

In order to show in more detail the impact of SNO, 
in fig.[7|wc plot the values of the variation in the ^ due 
to the SNO data: 

Ax^ = X^(post-SNO) - x^(prc-SNO). 

We see again that SNO results perfectly agree with LMA 
and LOW (for the value of the Boron flux predicted by 
solar models) and disfavour the SMA solution. Energy- 
independent solutions are also slightly disfavoured with 
respect to LMA and LOW: neglecting possible small cor- 
rections due to 013, energy-independent oscillations can 
at most convert half of the initial v^. into f^, Vt (for max- 
imal mixing), while SNO and SK indicate that the i'^, Vt 
flux is ^ 1.5 standard deviation higher than the flux. 

Finally, it is useful to recall another important re- 
sult: performing a global analysis including all experi- 
mental data (solar, atmospheric, reactor, short-baseline) 
in models with a fourth sterile neutrino, the best is 
obtained when the sterile neutrino is not used at all (so 
that atmospheric and solar data are explained by oscil- 
lations into active neutrinos, while the LSND data are 

not explained). This means that the present LSND evidence for effects due to a sterile neutrino is weaker 
than the present evidence of other experiments against it. 




Figure 7: Regions 'favoured' (green) and 'dis- 
favoured' (red) by the first SNO data. 



7 Addendum: NC and day/night SNO results 

In this addendum, we update our results by adding: 

• The i^d lypn (NC) and the ppe (CC) rate measured at SNO Pl] during the initial pure D2O phase 

using events with measured Cerenkov energy, T^s, larger than 5MeV. Assuming an energy-independent 
survival probability, < Pee < 1, SNO claims 

-RsNO = l-76±0.06(stat)±0.09(syst) lO^cm^^s^i = $8BPee, (Ha) 
i?^i?o = 5.09±0.44(stat)±0.46(syst) lO^cm-^s-i = $8B(l-77s(l-^'ee)) (Hb) 

(with some mild anti-correlation; < ryg < 1 is the eventual sterile fraction involved in solar oscillations) 
giving a a S.lcr direct evidence for neutrino conversion, -RsnoZ-^sno ^ ^- there are no sterile neutrinos 
{rjs = 0) Rg^Q = $88: confirming the solar model prediction for the Boron flux. Without assuming that 
Pee{Eiy) is energy- independent, the error considerably increases |34| 

^NC^ = 6.4 ± 1.6(stat) ± 0.6(syst) 10^ cm-^s^i. 

In order to make an oscillation fit one needs to extract i?gNo from the SNO data, taking into account the 
energy distortion predicted by oscillations. We do this using the data reported in fig. 2c of j34j^'^. 

^^We here give more details about the procedure we follow. Since SK tells that a significant spectral distortion is not present, in 
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• The day/night asymmetries measm'cd at SNO. Assuming oscillations of only active neutrinos, SNO ex- 
tracts the Vf. asymmetry to be A^^ — 7.0% ± 5.1% [HHl- At Ey ^ 10 MeV earth matter effects are too large 
for Ato^ values between LMA and LOW (that are therefore excluded) and are still present in the lower 
Ato^ range of LMA (here the regeneration effect is larger at high energy and for neutrinos that cross 
a sizable part of the earth) and in the upper Ato^ range of LOW (here the regeneration effect is larger 
at low energy and has a characteristic oscillating zenith angle behavior) Both SK and SNO see a 

1.5(7 hint for a regeneration effect, but neither SK nor SNO data show a statistically significant energy 
or zenith-angle dependence of the effect, that would allow to discriminate LMA from LOW. 

• The latest SK results (1496 days): the full zenith angle and energy distributions have now been 
released, so that SK data now disfavour more strongly oscillations parameters that give rise to detectable 
earth matter effects at SK (the lower part of the LMA solution and the upper part of the LOW solution). 
We use as SK data the rates in the 44 bins, whose content and uncertainties are precisely described in 
tables 1 and 2 of EHl- 

• The latest SAGE result The averaged Gallex/SAGE/GNO rate is now i?Ga = (72.4 ± 4.7) SNU (in 
place of the older value in eq. (jl2b|l 'l. making the LOW solution less disfavoured. 

• According to solar models, the total flux of *B neutrinos is proportional to S'i7(0) (that parameterizes the 
""Bep — > ^B7 cross section at zero energy). New recent measurements 

S'i7(0) = 17.8 ± 1.3 eV b gOl, Si7(0) = 22.3 ± 0.9 eV b gT], Si7(0) = 22.7 ± 1.2 eV b ga 

do not fully agree. Therefore we still use ^^(O) = lOlaeVb, as in BPOO 0. After the SNO NC 
measurement, this arbitrary choice is relevant almost only in fits with sterile neutrinos. 

The result of our global fit is shown in fig. (capabilities of SNO have been discussed in 021 )• The best fit 
lies in the LMA region 

X^(47 dof) = 33.0 at Am^ IQ-^-^eV^ tan^ e ^ -f 0.35. 

The LOW solution begins to be significantly disfavoured, because the precise SNO determination of the ^B flux 
makes more statistically evident that LOW oscillations give a poor fit of the other rates^^: 

Xlow ~ Xlma + 8.1 

(a difference of 10.7 is found in where the full SK zenith- angle/energy spectra are not employed). If LMA 
is the true solution, KaniLand could confirm it within few months. 

Unlike in older fits, the upper part of the LMA region (that in the two-neutrino context we are considering 
gives rise to energy-independent oscillations with Pee > 1/2) is now disfavoured also by solar data alone (the 
inclusion of the CHOOZ constraint now only has little effect), since SNO suggests Pee < 1/2. This can be 
considered as the first hint for MSW effects [Hj. 

Vacuum oscillations predicted spectral distortions not seen by SK. Before SNO, the remaining marginally 
allowed vacuum oscillation solution was 'JustSo^' (with Am^ < 10~^^cV^) that predicted Pee ~ 1 (i-C approx- 
imately no oscillations) at energies probed by SK and SNO. Oscillations with Am^ < 10~^cV^ now give the 
relatively better vacuum oscillation fit; their is larger than in the best LMA fit by about 14. 

order to obtain a precise fit of reasonably allowed solutions we only need to compute the shift in the central value of Rq^q ■ Since 

the NC signal are 7 rays of energy 6.25 MeV (detected through their Cerenkov photons), events with T^ff > 8 MeV are dominantly 
given by CC scattering (plus a small ES component, precisely measured by SK, and also measured by SNO looking at forward 
scattering events, fig. 2a of IMl ). The NC rate is then roughly obtained by subtracting the predicted CC rate from the number of 
events with r^ff < 8 MeV. Since most (about 2/3) of these events are due to CC scattering, a small spectral distortion in the CC 
spectrum can give a sizable shift of the NC rate. Including this effect the for a- typical LMA (LOW) (SMA) oscillation increases 
by 1/2 (0) (3) with respect to a fit naively done using cq.s tm . In presence of significant sterile effects, our approximate procedure 
could be not accurate enough if the NC efficiency at SNO strongly depends on E,j. We use the NC and CC cross sections from I3fi| . 

^''Thc SAGE/Gallex/GNO rate is higher than what predicted by LOW oscillations. It will be interesting to see the next GNO 
data. 
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Figure 8: Best fit regions at 90, 99, 99.73% CL assuming <-> v^^t oscillations. In fig. 03 we fit the CI, Ga 
rates, the SK day/night and zenith angle spectra, the SNO NC and CC rates and day/night asymmetries and 
the CHOOZ energy spectrum, assuming the BP 00 ^ solar model. In fig. \^ we drop the BPOO prediction for 
the Boron and Beryllium solar fluxes. 



In fig. |S|d we show the resuhs of a fit where we do not mclude the BPOO |S] predictions for the Boron and 
Berylhum fiuxes: the best-fit regions do not significantly change. From the point of view of osciUation analyses, 
the Berylhum flux is now the most relevant, not fully safe, solar model prediction (see [U for a discussion): 
according to BPOO [21 the percentage error on the Beryllium fiux is only two times smaller than the one on the 
Boron flux. Since Beryllium and Boron neutrinos are generated by different scattering processes of ^Be, the 
agreement of the Boron fiux with SNO data suggests (but does not imply) a Beryllium flux also in agreement 
with solar models. 

We now explain in simple terms which data eliminate the other solutions allowed in older fits. 
Autopsy of no oscillations 

The NC and CC data from SNO alone exclude this case at 5.1(t. The statistical significance of the solar neutrino 
anomaly increases to Sti, according to our global fit. 

Autopsy of small mixing angle oscillations 

For small mixing angle and in the SK and SNO energy range, i?^ ~ (5 15) MeV, the survival probability is 
well approximated by the Landau-Zener level-crossing probability. Pee ~ e^^l^" , where is a constant (fixed 
by the oscillation parameters and by the solar density gradient). In order to get Pee ~ 1/2 as demanded by 
SNO and SK one needs to choose E ~ 10 MeV. This implies dPee/ d\nEy ~ 1, in contrast with the undistorted 
energy spectrum observed by SK (see eq. ©) and SNO. Performing a global fit we find 

X^(SMA) = x^(LMA) + 23 
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Figure 9: Fig. Oi; best-fit regions of the Boron flux and of the fraction rjs of sterile neutrino involved in solar 
oscillations at 90 and 99% CL (2 dof). Fig.\^: fraction of sterile neutrino (1 dof) allowed in solar (continuous 
lines) and atmospheric (dashed line) oscillations: in both cases data prefer no sterile neutrino. 



i.e. a > 4(7 evidence for LMA versus SMA. As explained in section 4, the fact that x^/dof « 1.2 for the SMA 
solution does not mean that SMA oscillations give a good fit: the Pearson's test is not efficient in recogirizing 
bad fits when there are many dof. A more efficient test (see sections 4 and 6) confirms that SMA has a low 
goodness-of-fit, as expected. 

Autopsy of pure sterile oscillations 

Assuming no distortion of the energy spectrum of Boron neutrinos, SNO sees a S.lcr evidence for v^^^ appearance. 
This evidence decreases down to Scr allowing for generic spectral distortions, but significant spectral distortions 
are not allowed by SK. The sterile SMA solution (that gave the best sterile fit, before SNO) gives a spectral 
distortion that increases the SNO evidence for appearance, and is therefore excluded in two different ways. 
In our global fit we find 

X^(LMA sterile) = ^^(LMA active) + 29. 
SMA and LOW provide sterile fits slightly worse than LMA. 

Autopsy of '2+2' oscillations 

The main motivation for adding a sterile neutrino, i^g, is provided by the LSND anomaly. The extra neutrino 
can then be used to generate either the solar or atmospheric or LSND oscillations (so that active neutrinos can 
generate the remaining two anomalies). Only the last possibility is now not strongly disfavoured. In fact, taking 
into account that SK and MACRO atmospheric data give a Gcr evidence for v^^ versus v^J^ — > v^^ |45j , it is no 

longer possible to fit the solar or atmospheric evidences (or one combination) with one sterile neutrino: '2+2' 
schemes predict that the fraction of sterile involved in solar oscillations, plus the fraction of sterile involved in 
atmospheric oscillations, adds to one |44| . but both solar and atmospheric data refuse sterile neutrinos. In fig.l^Ji 
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we show how SK/SNO rates and solar model predictions for the Boron flux constrain the sterile fraction in solar 
oscillations. In fig. ^jp wc show how strongly data favor ?7s ~ in both solar and atmospheric oscillations. The 
'number of standard deviations' in fig. I^js is computed as = [x^iVs) ~ X^iVs = 0)]^^^, where is minimized 
with respect to the Boron flux. Atmospheric bounds have been compiled from (naively interpolating the few 
points given in |45|). where significant but unpublished data are employed. Solar bounds have been computed 
from SNO data alone (assuming an energy-independent Pee)i and from a global fit of all solar data. The result 
is not significantly different in the two cases, and depends on solar model uncertainties (e.g. the choice of 5*17)^^. 

We remark that the bound on the sterile fraction holds in the case of active/sterile mixing at a Am^ ^ 
10~'^eV^, as suggested by the LSND anomaly. Active/sterile mixing with two small Am^ (e.g. one Am^ around 
10~'^eV^ and the other around lO^^^eV^) could still give large effects at sub-MeV neutrino energies. 

Acknowledgements We thank Carlos Peha-Garay, Andrea Gambassi, Alberto Nicolis and Donato Nicolo for 
useful discussions. Wc thank Y. Suzuki for the unpublished SuperKamiokande data. 



A Details of the computation 

The energy spectra for the independent components of the solar neutrino flux have been obtained from 0. 
The neutrino production has been averaged for each flux component over the position in the sun as predicted 
in [3IH1- This averaging does not give significant corrections. MSW oscillations inside the sun have been taken 
into account in the following way. The 3x3 density matrix ps for neutrinos exiting from the sun is computed 
using the Landau-Zener approximation with the level-crossing probability appropriate for an exponential density 
profile [HJ |7] . The density profile has been taken from and is quasi-exponential: small corrections to ps have 
been approximately included. Oscillation effects outside the sun are described by the evolution matrix U, so that 
at the detection point pE = U psU'^ . In particular, earth regeneration effects have been computed numerically 
using the mantle-core approximation for the earth density profile. We have used the mean mantle density 
appropriate for each trajectory as predicted by the preliminary Earth model |25j . The detection cross sections 
in Gallium and Chlorine experiments have been taken from [S], performing appropriate interpolations. We have 
used the tree-level Standard Model expression for the neutrino/electron cross section at SK. 

The total neutrino rates measured with the three kinds of experimental techniques are El 

i?ci|cxp = (2.56 ± 0.22) SNU (12a) 
i?Ga|cxp - (74.7 ±5) SNU (12b) 
i?SK|oxp = (2.40 ± 0.08) • 10^ cm^^s-i (12c) 

where SNU = 10"'^^ interactions per target atom and per second. We have combined systematic errors in 
quadrature with statistical errors. The probability distribution function p(data|theory) is computed using 
the covariance matrix described in [HIES- Around the best-fits, it agrees well with the simpler pdf used in 
The experimental energy resolution at SK has been taken into account as sugg csted in ISEIEII- 

The solar-model-independent SK data included in the fit are the energy spectrum of the recoil electrons 
measured separately at SK during the day and during the night. Each energy spectrum |28| is composed of 17 
energy bins between 5.5 and 14MeV, plus one bin between 14 and 20MeV. For these data we have used the 
pdf suggested by the SK collaboration j2Hl , and described in |2ZI for a slightly different set of data. 

The results presented in the addenda have been obtained using updated data and a slightly improved fitting 
procedure, as described in section 6 and 7. 

^^If LMA is the right solution, it will be possible (assuming CPT-invariance) to put safe bounds on the sterile fraction when 
KamLand will have measured the oscillation parameters 9 and Am^ using reactor Pe- This issue has recently been precisely 
studied 1461 . If LOW were the right solution, solar-model-independent bounds on the sterile fraction can be derived from Borexino 
(and maybe KamLand) data about earth matter effects in Beryllium neutrinos. In fact, the oscillation length in matter is roughly 
proportional to 1/(1 — »?b/2), assuming that the earth density is known. 
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8 Addendum: first KamLAND results 



In this addendum, wc update our results by adding the SBtSOL^ ISS recently announced by KamLAND [J. We 
also update our previous analysis including the most recent data from GNO and the full set of spectral and 
day/night SNO data. We here list the set of data included in our fit. 



• Reactor anti-neutrino data (27 bins): 

— The KamLAND data, divided in 13 energy bins with prompt energy higher than 2.6 MeV ^.^^ 

— The CHOOZ data, divided in 14 bins and fitted as in analysis "A" of [S]. 

• Solar neutrino data (80 bins): 

— The SNO data [H], divided in 34 bins (17 energy bins times 2 day/night bins).^'' 

— The SuperKamiokande data divided in 44 zenith-angle and energy bins. 

— The Gallium rate [HI i?Ga = (70.4 ±4.4) SNU, obtained averaging the most recent SAGE, Gallex and 
GNO data. 

— The Chlorine rate Rci = (2.56 ± 0.23) SNU. 

• Solar model predictions. We assume the BPOO solar model ^H] and its estimates of uncertainties."'^* 

• Earth matter effects are computed in the mantle/core approximation, improved by using the average 
density appropriate for each trajectory as predicted in |11| . 



^^The energy spectrum of i/e emitted by a nuclear reactor can be accurately approximated as |3] 

W 



dn 
dE 



fi cxp(aoi -I- auE + a2iE^) 



where E = i?„/MeV. The sums run over the isotopes, i.j = {-^^^U, -^^^Pu, -^^^U, ^*^Pu}, which fissions produce virtually all the 
total thermal power W. Their relative abundancies fi, and the other numerical coefficients are [Tl l^ 



Isotope 


235 u 


239pu 


238 u 


241 Pu 


Relative abundancy fi 


56.8% 


29.7% 


7.8% 


5.7% 


Energy per fission Ei in MeV 


201.7 


205.0 


210.0 


212.4 


ao 


0.870 


0.896 


0.976 


0.793 


ai 


-0.160 


-0.239 


-0.162 


-0.080 


0-2 


-0.091 


-0.0981 


-0.079 


-0.1085 



i>e are detected using the reaction v^P — * efi H]. Using a scintillator, reactor experiments can see both the 7 ray emitted when 
the neutron n is captured, and the scintillation emitted by the positron e as it moves and finally annihilates with an e. The total 
measured energy is Evis = -|- "le , and E^ is related to the neutrino energy by the kincmatical relation Ei, = E^ + Kn + rUn — nip 
(where Kn is the small neutron kinetic energy; A.S. thanks F. Vissani for help in including it in our numerical code for the cross- 

MeV E]. The KamLAND scintillator contains Np = 3.46 lO^i free protons 

r), the number of neutrino events in 



section a). The energy resolution is = 7.5%\/-£'vis 
in its fiducial volume. Summing over all reactors r (that emit a power Wr from a distance 



any given range of E^ 



/ dE^ Pee{dr,E^)a(E^)Np 



dur/ dE,j 

iTvdl 



dE^ 



+ MeV)'^/2o-J 



2naE 



Neglecting small differences between the energy spectra of different reactors (i.e. assuming that all reactors have the same relative 
isotope abundancies fi), the spectrum of detected neutrinos n can be conveniently rewritten as n = no (Pee) where no is the rate 
expected in absence of oscillations and (Pee) is the survival probability, appropriately averaged over energy and distance. Assuming 
an energy-independent suppression, we find Pee = 0.625 it 0.094, in agreement with the more accurate KamLAND result. For 
recent related papers see |^. 

^^SNO rates receive contributions from CC, NC, ES and background events. SNO extracts from all data (energy and zenith-angle 
spectra) the CC, NC and ES components. We instead extract the CC, NC components from the energy spectrum, assuming the 
standard relation <1>es ~ ^CC -(-0.15<1?nc (the ES rate has also been accurately measured by SK). Assuming energy-independent 
oscillations between active neutrinos, our reanalysis of SNO data gives 



■ <I>s 



(5.2 ± 0.5) lO^/cm^s, $cc = -Pee^s 



(1.76 ±0.08) lO'^/cm^s. 



only slightly different from the corresponding SNO analysis |^ (the errors are somewhat anti-correlated) . 

l®In particular, we take into account the uncertainty on the energy spectrum of neutrinos, that affects in a correlated way 
SK, SNO and the other solar experiments. 
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Figure 10: Best-fit regions at 90, 99 and 99.73% CL obtained fitting (a) solar v data; (h) reactor v data (that 
do not distinguish 9 from 11/2 — 9); (c) all data (we find no other allowed regions at the considered CL outside 
the plotted range). 

Wc fit these data assuming v^, v^^t neutrino oscillations with a single Am^ and a mixing angle 9. The total 
is obtained by summing the contribution from solar v data, x^, with the contribution from reactor v data, 
= — 21n£ (the likelihood C is computed employing when appropriate the Poissonian distribution). Best-fit 

regions are computed in Gaussian approximation, cutting the A^^ at the value appropriate for 2 degrees of 

freedom. 

The result of our fit is shown in fig. ^| The best fit point (marked with a dot) and the la ranges for the 
single parameters (1 dof) are 

Am^2 = (7-0 ± 0.6) 10^^eV^ tan^ 6*12 = 0.45 ± 0.06. 

At the best fit point = 72.9 (80 bins) and xicamLAND ~ ^ bins^^). The disfavoured local minimum 
at higher Am^ = 1.53 10~^eV^ shown in fig. llOb ('HLMA' solution) has a 6.4 higher than in the best fit 
solution. According to our global fit, the combined evidence for a solar neutrino anomaly is almost lOcr; the 
evidence for LMA versus SMA is about 6cr; the QVO and LOW solutions arc disfavoured at about (4 ~ 5)(T. 
According to a naive Pearson x^ test these excluded solutions still have an acceptable goodness-of-fit, around 
20% (this happens because the x^ test has a limited statistical power when many data are fitted together, as we 
explained in section 4) while our more efficient test recognizes that their goodness-of-fit probability is ^ 10"'^. 
More importantly, present data are well described by LMA oscillations and do not contain hints for something 
more. 

KamLAND data have an important impact also on various related issues: 

• Maximal mixing? Fitting reactor data alone ffig.llObV we find the best fit point at Arn^ = 6.9 lO^^eV^, 
sin^ 29 = 0.99 but also a larger uncertainty on the mixing angle 9. Therefore maximal mixing remains 
disfavoured by solar data. In the global fit of fig. llOb . maximal mixing would enter the best-fit regions 
only at 99.96% CL (2 dof). Before KamLAND. maximal mixing was allowed in Q(VO) solutions at a 
reasonable CL. 

• Oscillations? If the present trend is confirmed by future KamLAND data, all proposed alternatives 
to oscillations will be excluded. If Am^<2 lO^^'eV^, with more statistics KamLAND can see a clean 
oscillation pattern, giving a precise measurement of Am^ This will tell if earth-matter effects can 
be observed by a larger SK-like solar detector. At the best-fit point the total day/night asymmetry is 
predicted to be 2.7% at SK and 4.5% in CC events at SNO, comparable to present sensitivities. 

• Future experiments. According to the present oscillation best-fit point, Borexino should observe no 
day/night asymmetry, no seasonal variation and a 65% reduction of the ''Be flux. If this will happen, solar 

^®We have normalized the likeUhood such that C = 1 when data agree with theorcthical predictions, and ignored all other issues 
related to the difference between Poissonian and Gaussian distributions. 
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experiments alone will tell that LMA is the unique oscillation solution, confirming the CPT theorem. The 
impact of KamLAND data on our knowledge of solar neutrinos will be discussed in an update to |12j . 
Knowing that LMA oscillations are the solution to the solar neutrino anomaly allows to plan which 
experiment (s) will give the most precise measurement of Am^ and 9 (see ^-^id ^ future update). 

• Three flavours. When analyzed in a three-neutrino context, solar and reactor experiments are also 
affected by atmospheric oscillations if ^13 7^ 0. For the observed value of Am^^jj^, CHOOZ gives the 
dominant upper bound on ^13. Therefore setting 61^ = is an excellent approximation for fitting solar 
data. 

• Neutrino-less double-beta decay. Assuming hierarchical neutrinos mi ^ m2 ^ the ee element of 
the neutrino mass matrix probed by 0v2(3 decay experiments can be written as |mee| = \'rrf^ + e'"m|*™| 
where a is an unknown Majorana phase and the 'solar' and 'atmospheric' contributions can be predicted 
from oscillation data. Including KamLAND data, the solar contribution to \mee \ is m-l™ = (2.6±0.4) meV. 
The CHOOZ bound on ^13 implies m|*'° < 2 meV at 99% CL. 

• Sterile? Experimental bounds on a possible sterile neutrino component involved in solar oscillations 
arc not yet significantly affected by KamLAND data, that instead have a strong impact on bounds from 
big-bang nucleosynthesis Such bounds cannot be avoided by modifying cosmology before neutrino 
decoupling and are usually presented in the (0, Ato^) plane assuming pure sterile oscillations. It is now 
more interesting to know the bound on a possible subdominant sterile component involved in LMA solar 
oscillations. We estimate that this bound is competitive with the direct experimental bound. 

• LSND. Solutions to the LSND anomaly based on extra sterile neutrinos are now disfavoured by other 
experiments 0]. The CPT- violating neutrino spectrum proposed in )16) allowed to fit all neutrino data, 
but predicted no effect in KamLAND. A slightly different CPT- violating solution mentioned in can 
give a suppression in KamLAND at the price of an up/down asymmetry of atmospheric muon neutrinos 
A « 20%, somewhat smaller than the measured value. This possibility is more disfavoured than '3+1 
oscillations' and less disfavoured than '2+2 oscillations', as will be discussed in an update to 0]. 
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Figure 11: Best-fit regions at 90, 99 and 99.73% CL obtained fitting (a) solar v data; (h) reactor v data (that 
do not distinguish 9 from 7r/2 — 9); (c) all data. These fits can he directly compared with the previous ones, 
shown in fig. \1(A 

9 Addendum: SNO data with enhanced NC sensitivity 

SK detects solar v using water as target. SNO now employs heavy salt water Heavy water allowed to 
study the NC reaction i'e,fi,Td i'e,^i,Tn-p. Salt allows to tag the n with higher efficiency (producing multiple 
7 rays from neutron capture) and therefore a more accurate measurement of total solar i'e,ii,T flux. However, 
the enhanced NC signal makes more difficult to study the CC energy spectrum. As a consequence the recently 
released 'salty' SNO data ^ can be conveniently summarized as a measurement of the total CC, NC and electron 
scattering (ES) neutrino fluxes. Uncertainties on the Boron flux, its energy spectrum, and cross sections are 
correlated with other experiments. We fit latest SNO data as suggested by the SNO collaboration (this means 
that we are slightly less accurate than the SNO collaboration itself). We employ the most recent value of the 
Gallium rate (2]. 

The result of our fit is shown in fig. 1111 where the best fit point is marked with a dot. The la ranges for 
the single parameters (1 dof) are 

Am?2 = (7-2 ± 0.7) 10-^eV^ tan^ 9i2 = 0.44 ± 0.05. 

At the best fit point xfoiar ~ ^5.0 (83 bins) and XxatnLAND ~ ^- ^^'^ total evidence for a solar anomaly is 12(t 
(dominated by solar experiments). We now discuss some consequences of latest SNO data: 

• Oscillations? LMA solutions with larger Am^ (present in our previous fit of fig. llOb 'l have been dis- 
favoured, because they predict a too large survival probability. This is important because only if 
Am^ < 2 lO-^eV^ KamLAND should be able, with more statistics, to see in the energy spectrum 
the typical spectral distortion produced by oscillations and to measure Am^ accurately So far no 
experiment has been able of seeing such a direct signal of neutrino oscillations. 

• Maximal mixing is disfavoured at 5.7(t (i.e. its is 5.7^ higher than the best LMA fit). 

• Symmetry? tan^ 6* = 1/2 (which together with maximal v^^ Vr atmospheric oscillations gives a highly 
symmetric neutrino flavour composition, with |(i^e./^.r 1*^2)1^ = 1/3) is 1.2a above the best fit value. 
Hopefully, future data will test this special value. 

• LMA. Omitting KamLAND D data and fitting only solar u experiments, the LOW solution is now 
disfavoured at 3.3a: Q.5a more than without SNO salt data. Other non-LMA oscillation solutions have 
now been excluded by solar data only. 

• MSW eflfect. SNO gives a 5a evidence for a survival probability less than 1/2. This can be explained 
by MSW enhancement of neutrino oscillations |H], while averaged vacuum oscillations of two- neutrinos 
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can only give Pee = 1 ~ 5 sin'^ 29 > 1/2 (see also 0]). Three-neutrino mixing can only marginally reduce 
Pee, since SK and SNO imply that ^13 is small. 

• Three flavours. When analyzed in a three-neutrino context, solar and reactor experiments are also 
affected by atmospheric oscillations if ^13 ^ 0. For the observed value of Am^j,-,-^, CHOOZ gives the 
dominant upper bound on ^13. ^13 = remains an excellent approximation for fitting solar data even 
taking into account that a recent SK analysis slightly reduces Arri^^^, thereby enlarging the combined 
CHOOZ and SK range of 6*13 from sin^ 26I13 = ± 0.065 to ± 0.085. (See also 0). 

• Neutrino-less double-beta decay experiments probe the ee element of the neutrino mass matrix. Newer 
SNO data (and the revised bounds on ^13) slightly shift the range of |mee| compatible with oscillations 0. 
Assuming hierarchical neutrinos ini 7712 <C and proceeding as in [H| we get 

\mee\ = (0.8 4.2) mcV at 90% CL and \mee\ = (0 5.3) meV at 99% CL. 
In the case of inverted hierarchy (i.e. <C mi sa 7712, or Arn^^ < 0) the updated range is 

\mee\ = (15 ^ 57) meV at 90% CL and \mee\ = (12 ^ 61) mcV at 99% CL. 

Latest SNO data imply a slightly stronger bound on the mass rrii, of quasi-degenerate neutrinos |^: 

< 0.9 (1.2) /leV at 90 (99%) CL. 

This bound is based on the fact that a large is compatible with the Heidelberg-Moscow upper bound 
on \mee\ only if 612 ~ 7r/4. The factor /i « 1, precisely defined in parameterizes the uncertainty in 
the 0i/2(3 ^^Ge nuclear matrix element. 

• Robustness. A few years ago global analyses were the only way of extracting the main features of the 
solar neutrino anomaly from the data. These analyses required a lot of work: a careful simulation of 
the sun, of the neutrino flavour evolution in the sun, in vacuum and in the earth, a careful estimation of 
all possible sources of systematic and theorethical errors. Global fits favoured the LMA solution before 
KaniLAND confirmed it. 

Present fits are dominated by recent data. Their interpretation is based on few key inputs: 1) the energy 
spectrum of Boron neutrinos, which essentially follows from kinematics. 2) matter corrections to neutrino 
propagation in the sun, which reduce the i^e survival probability by a 0(1) factor: from Pee = 1 ~ 5 sin'^ 2d 
at El, < MeV (as in averaged vacuum oscillations) to Pee ~ sin^ 6 (adiabatic MSW resonance) at energies 
probed by SK and SNO. Our results are obtained by running a precise code, but a good approximation 
could be obtained using a simplified code. 
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Figure 12: Best-fit regions at 90, 99 and 99.73% CL. Fig. I j|jb assumes CPT invariance and combines solar 
V data (dashed red contours) with reactor v data (dotted blue contours). The result can be directly compared 
with the previous one, shown in fig. Fig. \lSb shows the separate fit for Am^ in neutrinos and Am^ in 
anti-neutrinos, marginalized with respect to mixing angles. 

10 Addendum: final SNO 'salt' data and KamLAND 

We update our previous results including: the final SNO 'salt' phase results KamLAND data after 766.3 
ton-yr exposure the final Gallcx result and the latest SAGE result The result of our oscillation fit is 
shown in fig. 1121 where the best fit point is marked with a dot. The la ranges for the single parameters (1 dof) 
are 

Am?2 = (8-0±0.3) 10"^eV^ tan^ = 0.45 ± 0.05. (13) 
The total evidence for an effect is now about 12a in solar data and about 6a in KamLAND data. 

We remark that our results are based on a careful global fit, that was necessary to interpret the data available 
a few years ago. But present data speak by themselves, and a simple approximate analysis is sufficient to get 
results practically equivalent to the full ones in cq. H13|) . Indeed Amf 2 is directly determined by the position of 
the oscillation dip at KamLAND, with negligible contribution from all other experiments.^*^ The mixing angle 
is directly determined by the SNO measurements^^ 

(Pee) = $(l^e)/3'(i^e,M,r) = 0.35 ± 0.03. 

In the adiabatic limit LMA predicts (Pee) — sin^ 9. Well known analytic formulae allow to take non-adiabaticity 
into account, obtaining in the energy range explored by SNO: 

(Pee) « 1.15 sin^ 6*12 so that tan^ 6*12 = 0.44 ± 0.06 

which agrees with the results of the global analysis in eq. H13|) , both in the central value and in its uncertainty 
Global fits remain still useful for testing if the pieces of data not included in our simplified analysis contain 
statistically significant indications for new physics beyond LMA oscillations. At the moment the answer is no. 
E.g. fig. 112b shows the update of the CPT- violating solar fit of To really conclude, the experimental results 
auspicated in the conclusions of our first version have been achieved, removing the motivation for our analyses. 
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"This will be rigorously true in the future. For the moment solar data are needed to eliminate spurious solutions mildly disfavored 
by KamLAND data, as illustrated in fig. I12fe ,. 

^^Taking into account the SK measurement ^{ue) -I- 0.1553>(t'e,(/,T) = (2.35 it 0.06) 10^/cm^ sec and the solar model prediction 
^(I'e./i.T-) = (5.05 ± 0.86) 10®/cm^ sec would only marginally improve the measurement of (Pee)- 
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